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ABSTRACT 

The Kepler spacecraft observes a host of target stars to detect transiting planets. Requiring 
a 7.1 sigma detection in twelve quarters of data yields over 100,000 detections, many of which 
are false alarms. After a second cut is made on a robust detection statistic (|12l ). some 50,000 
or more targets still remain. These false alarms waste resources as they propagate through the 
remainder of the software pipeline and so a method to discriminate against them is crucial in 
maintaining the desired sensitivity to true events. This paper describes a \ 2 test which represents 
a novel application of the formalism developed by Allen (|6|) for false alarm mitigation in searches 
for gravitational waves. Using this technique, the false alarm rate can be lowered to ~ 5%. 



Subject headings: methods: statistical 

1. Introduction 

The Kepler spacecraft continuously observes 
more than 150,000 target stars in a 115 square- 
degree held of view to discover Earth-like planets 
transiting Sun-like stars through analysis of pho- 
tometric data (0; 0)- The Kepler spacecraft col- 
lects photometric data for each target star which 
is compressed and stored on-board to be down- 
linked at monthly intervals. The Kepler Science 
Operations Center at NASA Ames Research Cen- 
ter processes the data with the Science Processing 
Pipeline, which is composed of several modules in- 
cluding the Transiting Planet Search (TPS) (0). 
To search for transit signatures, TPS employs a 
bank of wavelet-based matched filters that form 
a grid on a three dimensional parameter space of 
transit duration, period, and epoch <d S). Ow- 
ing to non-stationary and non-Gaussian noise, un- 
corrected systematics, and poorly mitigated noise 
events of either astrophysical or non-astrophysical 
nature, large spurious Threshold Crossing Events 
(TCEs) can be produced by the matched filtering 
performed in TPS. 
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The optimal linear filter for a deterministic sig- 
nal buried in Gaussian noise is a simple matched 
filter (0). The output of a matched filter can be 
large, both, when a true signal is contained in the 
data and when the noise mimmicks the true sig- 
nal closely enough. In a perfect detection scenario 
the noise would be stationary and Gassian, fur- 
nishing static false alarm and detection probabil- 
ities. Under this scenario, one can easily apply 
the Neyman-Pearson criterion to set a detection 
threshold that maximizes the detection probabil- 
ity for a given, tolerable, false alarm rate. In re- 
ality however, a plethora of outside influences can 
contribute non-stationary and non- Gaussian noise 
to the data stream. An effort can and should 
be made to understand as many of these periph- 
ereal noise sources as possible and remove their 
effect through model fitting. Imperfections in the 
removal of known noise sources and other spu- 
rious noise events may then however necessitate 
the formulation of a discrimination strategy for 
the remaining false alarms that contribute to a 
false alarm rate that is above what is expected 
from consideration of Gaussian statistics alone. 
The chosen discrimination method should have as 
a requirement to preserve the existing detection 
probability while simultaneously reducing the false 
alarm rate. True positives should easily pass the 
test, while false positives should be mitigated. 
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A clever approach to this problem was devel- 
oped by Allen Q6f) for use in gravity wave searches 
in LIGO data. In that work, the chief astrophysi- 
cal source of interest was the inspiraling compact 
binary system. The signal, or chirp, from such a 
system is a sinusoid with both a frequency and 
amplitude that diverge as the stars approach the 
merger phase of the inspiral event. The x 2 dis- 
criminator is essentially built by breaking up the 
frequency band of the detector into chunks and 
for each chunk comparing the expected response, 
given the template waveform, to the actual re- 
sponse. Allen (j6|) shows that the expectation value 
of the x 2 statistic is independent of whether or 
not a signal is present in the data. This property 
makes it a good discriminator since its value de- 
viates from the expectation value only when the 
noise deviates from its assumed Gaussianity and 
stationarity. So it provides a method for target- 
ing contributions from non-Gaussian tails that end 
up furnishing TCEs. Allen's work goes on to show 
that under the assumptions of both stationary and 
Gaussian noise, the x 2 statistic can be proven to 
have a x 2 probability density function. The paper 
also shows that when there is mismatch between 
the true signal and the template, the probability 
density function of the x 2 statistic then becomes 
a non-central x 2 distribution whose non-centrality 
parameter depends on both the signal-template 
mismatch and the square of the expected SNR. 

Allen's method was originally formulated for 
use with broad-band deterministic signals, with 
unkown signal parameters, and detectors with po- 
tentially non-ideal observing noise. In the case 
of planet detection, the problem is similar. The 
waveforms used by TPS depend on a set of three 
parameters, namely, epoch, period, and tran- 
sit duration ({to,T, d} respectively). Planetary 
transits produce periodic depressions in the light 
curves of their host stars. The exact shape of the 
depression depends strongly on both stellar and 
planetary parameters. TPS avoids this complica- 
tion by simply using pulse trains of square waves, 
which means that the templates and true signal 
are inherently mismatched. The degree of mis- 
match is furthered by the fact that only a discrete 
set of points in the three dimensional parameter 
space is searched over. The goal of TPS is to 
simply identify those targets that potentially have 
something interesting and should be followed up 



with the final Pipeline module, Data Validation 
(DV). DV then does a much more thorough job of 
fitting an astrophysical transit model to determine 
the validity of the TCE. 

Although the signal detection problems differ 
between the gravity wave detection, for which the 
X 2 formalism was originally developed, and the 
transiting planet detection, the basic ideas can still 
be applied. Rather than breaking up the detec- 
tor's bandwidth into different channels, we instead 
analyze the components of our test statistics in 
multiple ways that are completely analogous with 
the x 2 formalism developed by Allen. Many of the 
results can then be directly applied. 

This paper is organized as follows. Section [5] 
reproduces the detection theory employed by TPS 
for completeness. Section [3] then applies the x 2 
formalism to produce a discriminator for TPS and 
discusses its expected distribution. Section H] dis- 
cusses some of the other possible versions of the x 2 
discriminator. Section[5]shows what the effect of a 
mismatch between the signal and template has on 
the x 2 statistic. In this section the true mismatch 
is estimated by a Monte Carlo study which also 
proves to be useful for generating an astrophys- 
ically motivated set of templates for TPS. Sec- 
tion [6] shows how TPS uses the discriminators for 
thresholding purposes now and and how it might 
use them in the future. Section [7] gives some ex- 
amples that illustrate how the x 2 discriminators 
work and what their strengths and weaknesses are. 
Section [8] gives some results based on analysis of 
twelve quarters of Kepler data. Then Section |9] 
gives a conclusion which summarizes the main re- 
sults, gives prospects for future work, followed by 
acknowledgments . 

2. Detection Theory 

The data input to TPS are discrete, contiguous, 
flux fraction time series that have been corrected 
for systematics and had some other more localized 
noise artifacts removed such as sudden pixel sen- 
sitivity dropouts, cosmic rays, and thermal tran- 
sients. For a discussion of how the data are pre- 
pared for the search in TPS see (|l2l ). Let x(n) be 
this discrete, contiguous, flux fraction time series, 
where n £ [1,...,N]. Under the null hypothesis, 
HO, there is no transit signal present and we have 
only noise w(n) which we assume to be zero-mean, 
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White Gaussian Noise (WGN), with variance a 2 . 
Under the alternative hypothesis, HI, there is a 
transit pulse signal s(n) present in the data (for 
simplicity, assume for now a single pulse is present 
rather than a pulse train). We then have: 



HO : x{n) 
HI : x(n) 



w(n) 

w(n) + s(n). 



(1) 



An often used result from detection theory is that 
for a known signal, the optimal detection statistic 
is a simple matched filter of the form 



x ■ s 



(2) 



where we have used bold-faced type to denote vec- 
tor quantities and a is the standard deviation of 
w(n). Since z is a linear combination of Gaus- 
sian random variables, it too is a Gaussian ran- 
dom variable. The mean and variance of z under 
the two hypotheses therefore completely charac- 
terize the detection problem. It is straightforward 
to show that under both hypotheses, the variance 
of z is unity, while the mean under H is zero and 
the mean under HI is the SNR of the signal: 



HO 



0. 



( z )hi 



Zl 



(3) 



(4) 



({z-z ) 2 ) m = {{z- Zl f) H1 

= 1. (5) 

Now it is easy to see that the probability density 
functions under the two hypotheses are given by: 



Vi{x) 



exp[ V x > ], i = {0,1}, (6) 



/2vr 



and the corresponding false alarm and detection 
probabilities as a function of the decision making 
threshold rj are: 



Pfa 



Po(x) dx 



Px(x) dx. 



(7) 
(8) 



In this detection scenario there is no knowledge of 
prior probabilities nor any costs associated with 
false alarms or false dismissals, so the best strategy 
to adopt is to apply the Neyman-Pearson criteria. 
A maximum tolerable false alarm is chosen which 
determines the threshold 77 and the corresponding 
detection probability. 

Since we are observing the light from many tar- 
get stars with highly varied properties, the noise 
w(n) is typically not white, but colored. Colored 
Gaussian noise can be modeled as the result of fil- 
tering white Gaussian noise through a linear but 
possibly time- varying filter (@) . The noise will gen- 
erally then possess an auto-correlation matrix, R, 
with non-zero off-diagonal components. The opti- 
mal detector is still the simple matched filter but 
it must account for the off-diagonal components: 



(9) 



Vs T R" -s 

Since R is an autocorrelation matrix of the noise 
w(n), it is non-singular and symmetric and there- 
fore possesses a square root. So we can rewrite © 
as 

(R- 1 / 2 x) T (R- 1 / 2 s) 



(R- 1 / 2 s) T (R- 1 / 2 s) 



(10) 



where x = R _1 ^ 2 x and s = R _1 ^ 2 s are whitened 
versions of the data and signal vectors. Since R is 
typically not available, one is faced with the dif- 
ficult task of designing an appropriate whitening 
filter to construct the test statistic in the presence 
of colored noise. Since the stellar irradiance can 
clearly exhibit (colored) nonstationary behaviour, 
a wavelet-based adaptive matched filter that con- 
structs a time- varying whitener is employed Q . 
This detector performs a joint time-frequency de- 
composition of the data to estimate the proper- 
ties of the noise as a function of time, then ap- 
plies a matched filter to the whitened data in the 
wavelet domain, taking into account the effect of 
the whitener on the shape of the transit pulse. 

The wavelet based approach employed by TPS 
uses an Over-complete (discrete-time) Wavelet 
Transform (OWT) of the data and template with 
Debauchies' 12-tap wavelets The wavelet do- 
main is a natural choice for designing time- varying 
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filters since it is a joint time-frequency representa- 
tion of the transformed data. The wavelet-based 
matched filter uses an octave-band filter bank to 
separate input flux time series into different band 
passes to estimate the noise Power Spectral Den- 
sity (PSD) as a function of time. For details of 
the filter bank implementation see HI) ■ For our 
purposes here, define the OWT of x(n) as 

W{x(n)} = {aji(n), x 2 (n), x M (n)}, (11) 

where 

Xi {n) = hi(n) * x(n) , i = 1, 2, M, (12) 

'*' denotes convolution, and the hi(n) are the im- 
pulse responses of the filters in the filter bank 
implementation of the wavelet expansion with 
corresponding frequency responses Hi(uj). The 
filter Hi is a high-pass filter that passes fre- 
quency content from half the Nyquist frequency to 
Nyquist ([/ Nyquist A j 'Nyquist])- The next filter, 
H 2 , then passes frequency content in the interval 
[/ 'Nyquist /4, / 'Nyquist /2]. Each subsequent filter 
passes content in the next lower bandpass until 
the final filter, Hm, passes the lowest bandpass 
on down to DC. The time- varying channel vari- 
ance, of, in each channel i of the filter bank is 
estimated by a moving circular median absolute 
deviation (MAD) with an analysis window cho- 
sen to be significantly longer than the duration of 
transit pulse. 

To obtain the wavelet-based expression for the 
matched filter, we need to be able to express 
the dot product in the wavelet domain. For an 
overcomplete, dyadic, wavelet expansion, the dot 
product can be expressed as: 

M 

x.y = ^2- rai ^ M - 1 )x i .y l , (13) 

i=l 

where x and y are time series, and and are 
the wavelet components which are also time se- 
ries (fiol). The restriction on the power of two in 
(fT3| is necessary because the last two channels of 
the OWT have the same bandwidth. The detec- 
tion statistic is now computed by multiplying the 
whitened wavelet coefficients of the data by the 
whitened wavelet coefficients of the transit pulse 



and employing the dot product relation: 
x • s 

z — —j= 

V§ ■ s 

J2iLi 2-"^-D Eti XjjnMny&Kn) 

(14) 

This equation ([M]) is the Single Event Statis- 
tic (SES). It allows us to compute the detection 
statistic for a transit pulse s of a specific duration, 
where the transit is centered at a single point in 
time n. The of (n) are the whitening coefficients 
which are variance estimates for each time point 
and for each wavelet scale i. These are estimated 
by a moving circular Median Absolute Deviation 
(MAD) at each scale with a base window size that 
is thirty times longer than the pulse duration. 

At the outset of our analysis, the timing of a 
real transit signal, namely its epoch and period, is 
unknown. Therefore, it would be useful to gen- 
erate a time series of detection statistics, z(n), 
which represents the detection statistic at sample 
n for a single transit pulse which is centered at 
n. This can be done via N repeated invocations 
of equation (|14[) . in which we recenter the tran- 
sit pulse at all possible times n sequentially. Al- 
though effective, this method of computing z(n) is 
cumbersome, requiring that the wavelet transform 
of equation (|llj) and the dot product in equation 
(TT4")) be performed for each of the N unique tran- 
sit pulse models s. An equivalent result can be 
achieved more efficiently by recognizing that the 
mass of invocations of equation (|14p is equivalent 
to computing a cross correlation, in which the ar- 
ray of different s vectors is replaced by a single one, 
representing a single transit of unit depth centered 
at n = 1. 

To compute the detection statistic z for a given 
transit pulse centered at all possible time steps, we 
can simply doubly whiten W{x(n)} (i.e. divide 
Xi(n) point- wise by of (n)), correlate the results 
with W{s(n)}, and apply the dot product rela- 
tion, performing the analogous operations for the 
denominator, noting that (n) is itself a time 
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series: 



z(n) 



N(n) 

E"j 2- min ( i ' M - 1 )[z i (n)/£?(n)] * «<(-n) 



(15) 



Note that the '— ' in s<(— n) indicates time reversal. 
The N(n) and D(n) are introduced for convenience 
later on. 

The quantity z(n) in equation (fT5"j) is referred 
to as the Single Event Statistic (SES) time series. 
Note that the quantity y/I$(n) is the epected SNR 
of the template in the data as a function of time, 
or the expected SNR of a true signal that matches 
the shape and amplitude of the template exactly. 
To make explicit the dependence on the signal am- 
plitude, A, under HI let: 



x = w + ,4s. 



(16) 



The relevant statistical quantities are then given 
by: 

(z(n)) = Ay/nfij, (17) 



(z 2 (n)) = l+A 2 B(n), 



(18) 



where, under HO, we can simply let A — >■ 0. So un- 
der either hypothesis, the SES has unit variance. 

Up until now we have only explicitly used one 
of the signal parameters, namely, the transit du- 
ration d, which is built into the template. The 
TPS module currently searches over 14 trial tran- 
sit durations logarithmically spaced between 1.5 
hours up to 15 hours. To perform the search over 
the remaining two parameters, period and phase 
(T and to respectively), we must lay down a grid 
in the parameter space that balances the need to 
preserve sensitivity to the astrophysically interest- 
ing parameter space with the need for computa- 
tional tractability. The sensitivity requirements 
dictate the spacing on the parameter space. For a 
discussion of this see (j!). To perform the search 
over period and phase, the single event statistics 
must be folded using each discrete point in param- 
eter space. In practice however, to reduce com- 
putation, a more sophisticated folding routine is 
employed that ensures all interesting portions of 
parameter space get searched while uninteresting 



portions are skipped over on a target-by-target ba- 
sis (|12h . A description of how to compute the Mul- 
tiple Event Statistic (MES), Z(t Q ,T,d), is given 
here. 

Choosing a particular point in the {T, to} space 
selects out a set, S, of P samples, one for each 
transit, that start with the sample corresponding 
to the epoch to and are spaced T samples apart. 
These samples form a subset of {n}, S = {to, to + 
T,...,t Q + (P- 1)T}. The MES is then constructed 
as: 



(19) 



ies 



ies 



This section will derive a version of the x 2 
statistic that has been found to be useful in the 
analysis of Kepler data. A brief summary of the 
formalism developed by Allen is first in order. 

The basic idea behind the construction of the 
test statistic is to break up the matched filter out- 
put into several contributions and compare each 
contribution with what is expected. What follows 
in this paragraph is taken from Q) for complete- 
ness. First, the detector output z is broken up 
into p chunks. Mathematically we have, 



E 



'3 ' 



(20) 



where the Zj are additive chunks of the filter out- 
put that when added together reproduce exactly 
the output value of the filter. These are the actual 
contributions to the filter output. Next consider 
the p quantities defined by 



Az 3 = Zj 



where 



E 



9j 



1, 



(21) 



(22) 



and the qj are the expected fractional contribution 
to z from the j'th chunk. The Azj are then the 
set of differences between the p actual contribu- 
tions and expected contributions. By definition, 



the Azj's sum to zero 



E A " 

4=1 



(23) 
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and their expectation values vanish 



so clearly we have: 



(Az,) = 0. 
The x 2 statistic is then defined as 

X 2 =£(A 2j -)7<Zi- 



(24) 



(25) 



Note that with some basic assumptions on the de- 
tector noise, the expectation value of this statistic 
is independent of whether or not a signal is present 
in the data, thereby making this an ideal discrim- 
inator for noise events. The noise considered here 
is assumed to meet the following criteria: 



(w(n)) = 0, 



(26) 



(w(n)w(m)) — 5(n — m), (27) 

where S(n) is the Dirac delta function. So we as- 
sume the whitened noise has zero mean, unit vari- 
ance, and is uncorrelated. 

This concept can now be applied to the MES 
given in (fH?)) by breaking it up into a set of P 
contributions, where again, P is the number of 
transits. The MES calculation was done in the 
wavelet domain to properly whiten the data and 
templates. Due to the imperfect localization of the 
filters in the OWT however, the wavelet compo- 
nents become correlated in an intricate way. In or- 
der to eliminate this correlation effect we can sim- 
ply apply the inverse OWT on the whitened data 
and template prior to computing the veto. This 
will also make it easier to compute the statistical 
properties of the quantities of interest as well as 
shed light on some of the subtle issues surrounding 
the calculation. The MES can be re-written as: 



Z = 



E 



3 = 1 X ' S J 

Ef=i h • s 



Ef=l E,n=lH n ) S j( n ) 



E 7 -=iE n =i 5 fW 



(28) 



where l ~' denotes a whitened vector and Sj(n) is a 
template with a transit pulse centered at the time 
corresponding to transit j € S. Now, let 



E*=i 5 ( n ) s j( r 



Efc=i E 



JV 



(29) 



3=1 



(30) 



Similarly, identify qj as 

% = 



where clearly 



En=l 5 j(") 

ELiELi 5 K n ) 

3=1 



(31) 



(32) 



The Zj are the temporal contributions to the MES 
and the qj are the fractional expected temporal 
contributions. The Azj and \ 2 statistic can be 
constructed as: 



Azj = Zj - q 3 Z 



2= y(AZjY 



(33) 



(34) 



There are three subtle issues involved in this 
calculation that have been neglected. The first is- 
sue is related to the calculation of the whitening 
coefficients, or the cr's, in (jT4"|) . The noise is esti- 
mated at each wavelet scale by a moving circular 
MAD filter. This method is robust against outliers 
but if there is a planetary transit signal in the data 
then it can perturb the whitening coefficients. The 
X 2 calculation requires that the qi components be 
explicitly independent of the presence of a transit 
signal. Therefore, prior to computing the whiten- 
ing coefficients, the in-transit samples are gapped 
and filled using an auto-regressive algorithm to 
guarantee the necessary signal independence of the 
whitening coefficients. 

To understand the next subtlety associated 
with the calculation of the veto, consider that 
under HI, the data can in general be written: 



x(n) = w(n) + As(ri), 



(35) 



where s(n) is a transit pulse train rather than 
a transit centered at some time corresponding to 
transit j as in the template version of Sj(n). This 
difference introduces correlation in the Zj's. This 
issue can be handled in a similar manner as the 
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whitening coefficients above. To get rid of this 
correlation we simply have to gap and fill all the 
in-transit samples that are not associated to the 
transit identified by j prior to computing each Zj . 
This effectively turns x(n) into Xj(n): 



Xj{n) = w(n) + Asj(n). 



(36) 



The final subtlety is that after whitening our 
template in the wavelet domain we have to zero 
out all the out-of-transit samples since the effect 
of the transit gets smeared out across more sam- 
ples even after doing the inverse OWT. In fact, 
there would not be any out-of-transit samples with 
a value of zero in the template without the win- 
dowing. This correction is necessary so that the 
Zj components achieve the correct statistical prop- 
erties and so the Azj have the correct correlation 
structure. Note also that this windowing will make 
Z 7^ MES in general. In what follows it is as- 
sumed that these subtleties are being corrected for 
as stated above. 

The relevant statistical properties of the various 
quantities of interest can now be computed and 
summarized: 

(Z) = AD, 

(Z 2 ) = 1+A 2 D 2 , 

(Zj) = q 3 AD, 



(z 2 )=q ]+q 2 A 2 D 2 , 

(ZjZk) = q 3 S 3k + q 3 qkA 2 D 2 , 
(Az 3 ) = 0, 

((Az j ) 2 )=q j (l-q j ), 

(x 2 )=P-h 
((x 2 ) 2 ) = P 2 1, 



(37) 



where, 



D 



\ 



N 



j=l n=l 



(38) 



the dij is the Kronecker delta 



and the statistical 
properties under HO can be obtained by letting 
A^ 0. 

Using the afforementioned assumptions on the 
detector noise and also assuming a perfect match 
between the signal and template, a proof was given 
in (@) that this x 2 statistic is x 2 -distributed with 



P — 1 degrees of freedon0. The cumulative prob- 
ability that x 2 < Xo i s given by: 



x^<xt 



/A 2 2 ) 



2'e 



7(1 



r(f-l) 



2' 2 ) 



du 



(39) 



where 7 is the incomplete gamma function. To 
check for this expected behavior we performed a 
100,000 sample Monte Carlo under both HO and 
HI. To speed up the test we used 1024 sample 
time series of random white noise generated with 
different noise seeds. In the run under HI we then 
injected three square waves that were 12 samples 
long and spaced 400 samples apart. Each injected 
square wave had a depth of 8/y/3. The cumula- 
tive distribution functions for each hypothesis are 
shown in Figure [T] The curves under both the 
null and alternative hypotheses match with the 
theoretical model extremely well for this Gaussian 
noise case. 

Note that this version of the x 2 discriminator 
has been referenced in Appendix A of (fl2[). There, 
it is referred to as Xpv The results presented in 
that work however use an older version that had 
not been fixed to account for the subtleties men- 
tioned above. The version presented in that pa- 
per was also cast in the wavelet domain, which is 
flawed due to the correlation introduced as men- 
tioned above. Results given here in section |8] are 
obtained with an updated codebase that takes into 
account all the subtleties and uses this re-cast time 
domain version. To allow for easier comparison, 

we will refer to this version of the x 2 m subse- 

2 

(2)- 



quent sections of this paper as x 2 



4. Other x 2 Tests 

The x 2 statistic presented in section [3] involves 
splitting up the MES into its temporal contribu- 



(0), the unequal expected SNR interval case was treated 
with the additional assumption that the signal and tem- 
plate did not match exactly. In that case the distribu- 
tion of x 2 was proven to be a non-central x 2 distribution 
with P — 1 degrees of freedom and a non-centrality param- 
eter that was proportional to the mismatch and the SNR 
squared. Since we have assumed zero mismatch between 
the signal and template, the non-centrality parameter is 
zero and therefore the non-central x 2 distribution becomes 
the (central) x 2 distribution. 
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CDF for x 2 Statistic 




X 2 Threshold 

Fig. 1. — Cumulative Distribution of the x 2 
statistic resulting from the 100,000 sample Monte 
Carlo under HO and HI with Gaussian noise. Time 
Series of length 1024 samples were used. Injected 
transits had a duration of 12 samples and were 
spaced 400 samples apart. The model here is a x 2 
cumulative distribution function for a % 2 with two 
degrees of freedom. 



tions. In addition to this version, several other 
versions have also been explored that break up the 
detector output in other ways. One such alternate 
method analyzes the wavelet contributions to the 
SES. Starting with (TTSl) . we can identify the z% and 
qi wavelet components as: 



Zi(n) 



and 

1i{n) 



2- min ( i ' M -V{xi[n)/a? (n)] * Si(-n) 

(40) 



gi(n) 
B(ra) 

2- min ( i - M - 1 )gr 2 (n)*3?(-n) 

(41) 

where now the Zj(n) are the actual contributions 
the the SES time series from the i'th wavelet 
component and qi(n) are the corresponding ex- 
pected contributions. The Ni(n) and Dj(n) are the 
wavelet components of the previously defined N(n) 
and H)(n). Now the \ 2 statistic can be formed: 



Azi(n) = z l {n) - qi(n)z{n) 



M 



x 2 w = E 



[Azj(n)] 



(42) 



(43) 



where M is the number of wavelet scales and is 
determined by the number of data samples N and 
the length of the mother wavelet filter chosen to 
implement the filter bank. This statistic would be 
X 2 distributed with M — 1 degrees of freedom if 
there was no overlap between the wavelet scales. 
Since there is overlap however, using this statistic 
for vetoing purposes can be dangerous because the 
overlap is difficult to model. We have a value for 
this statistic at each j £ S, so we can form a 
coherent statistic by adding up the P points that 
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contribute to the MES. This gives 



xh - Ex 2 U) 

M 



EE 

jes »=i 



[A^(j-)] s 
QiU) 



(44) 



The degrees of freedom in the perfect case with 
no overlap between the wavelet scales would then 
become P{M — 1). In reality however, the overlap 
lowers the degrees of freedom and alters the corre- 
lation structure of the Az^s thereby skewing the 
distribution form. This version of the statistic has 
been referenced in Appendix A of {HI). There it 
was referred to as Xi\\ as it i s now being refferred 
to in this paper. There is no difference in the way 
this version was formulated in (|12|). 

In a similar way, we could also analyze the 
wavelet contributions to the MES. 



Q, 



(45) 



(46) 



where Zi are the actual wavelet contributions to 
the MES and the Qi are the expected wavelet con- 
tributions. Now, x 2 can be constructed: 



AZj — Zi — QiZ 



M 



AZ? 



X 



i=l 



(47) 



(48) 



where now Z is the sum over i of Z{ in equa- 
tion (|45p. Again however, this statistic is not \ 2 ~ 
distributed with M — 1 degrees of freedom due 
to the overlap between the wavelet scales. This 
statistic has not proven to be very useful, so it is 
being omitted from the numbering scheme. 

A version more akin to a classical \ 2 statistic 
can be formulated by simply comparing each SES 
that contributes to the MES with what we expect. 
The observed value for each SES is given by equa- 
tion (|14p , where we can make it more explicit that 
the transit pulse is centered at some time n = j 



where j € S. To avoid the subtle correlation is- 
sues that arise in the wavelet domain we also use 
whitened time domain vectors: 



x • s,- 



(49) 



The expected value is given by equation Q17[). 
again with the pulse centered at a specific n = j: 



j - 



(50) 



where A is the signal depth which can be esti- 
mated by robustly fitting the full whitened trial 
pulse train to the whitened data. With these com- 
ponents we can construct the classical \ 2 as: 



*(3) " (77) 
j= i \ J/ 



(51) 



Other versions can be formulated that analyze 
the contributions to the SES and MES from each 
in-transit sample as well. These tend not to work 
well however since the number of degrees of free- 
dom can become very high, thereby diluting the 
effect of any glitch that might be causing problems 
for the detector. Of course, still more versions can 
be formulated by mixing these up in all possible 
permutations, for example, analyzing the wavelet 
contributions to the in-transit samples. 

5. Effect of Signal/Template Mismatch 

Up to this point it has been assumed that the 
true astrophysical signal matches exactly with one 
of the templates in the template bank. In TPS cur- 
rently however, this is never the case. The Tran- 
siting Planet Search currently uses as its model a 
square wave pulse train that is parameterized by 
three parameters, namely, epoch or phase, period, 
and duration ({to,T, d} respectively). A discrete 
grid of templates on this three dimensional param- 
eter space is used for the search, so even if a true 
astrophysical signal were square wave in shape, 
there would be mismatch induced by the discrete 
nature of the search grid. In transit duration, TPS 
uses a logarithmically spaced set of 14 different 
values from 1.5 to 15 hours. For period and epoch, 
there is a minimum correlation requirement of 0.9 
between neighboring templates that is used to de- 
termine the spacing. Periods are searched in the 
range from a half a day out to half the length of 
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the data since we currently require 3 transits for a 
detection. The epoch is then searched over [0,T]. 
The mismatch induced by using this discrete three 
dimensional grid then adds to whatever inherent 
shape mismatch there is between the square wave 
pulse and the true astrophysical signal. The size 
of the total mismatch drives down the signal-to- 
noise ratio (SNR). This gives rise to a sort of bal- 
ancing act between detection efficiency and com- 
putational time or tractability of the search. For 
a discussion of the correlation coefficient and how 
it relates to the match between neighboring tem- 
plates see (@) or (HI). 

When a target is identified by TPS as contain- 
ing a threshold crossing event it gets sent on to the 
Data Validation (DV) portion of data processing 
pipeline. Here, a model fitting algorithm fits an 
astrophysical model to the light curve. The astro- 
physical model employed is the geometric transit 
model of Mandel and Agol |l3). The limb darken- 
ing is taken into account using the non- linear limb 
darkening model of Claret (|l4 ) which depends on 
some stellar parameters for the target star, namely 
its effective temperature, metallicity, and surface 
gravity. This level of complexity is required to 
produce the correct transit shape, which is not a 
square wave. In what follows we will assume that 
the true signal is perfectly described by this transit 
model pulse train that DV produces. 

Here we will consider only how the mismatch 
between signal and template can affect the distri- 
bution of X( 2 ) °f section [3j Recall that xp) ex ~ 
amines the temporal contributions to the MES. 
Effectively then, each piece of the statistic is inde- 
pendent of the period since they are all treated in- 
dividually. The mismatch in period simply causes 
the timing offset for a given transit to be a function 
of the transit time. So the period mismatch can 
just be lumped together with the epoch mismatch 
when we consider how much an individual transit 
can potentially be mismatched in time from the 
real transit. 

To begin, let s and s' denote the whitened, win- 
dowed, template and true astrophysical transit re- 
spectively, centered at some arbitrary time. Now 
under HI the true signal is present in the data: 



x'(n) = w(n) + As'(n), 



(52) 



where the SES is now given by: 
SES = XJL 



ys • s 

[w • s + As' ■ s] 



(53) 



Now Schwartz's inequality can be used to bound 
the inner product of the template and signal vec- 
tors: 



(s'-i) 2 < (§.s)(s'.§'), 



(54) 



or in terms of the associated unit vectors we have 



(S 



< 1. 



(55) 



So the dot product of the unit vectors has to 
be in the range [—1,1]. As in (6) , this quantity 
is often referred to as the fitting factor (and is 
directly related to the correlation coefficient dis- 
cussed above). Following (0), we can let 



cos 8, 



(56) 



or, adding back in the dependence on the normal- 
ization: 

s' ■ s = cos#Vs 



(57) 

where, without loss of generality, we can restrict 
9 to be in the range [0,7r/2]. Now the template 
mismatch e is given by: 



cos9 = 1 — e. 



(58) 



The mean and variance of the SES given in 
equations (| 1 T[) and ([18]) can now be re-computed 
to account for this mismatch: 



{z(n)} = A cos 9 y/W(n), 



(59) 



(z 2 (n)) = 1 + A 2 W{n) cos 2 9, (60) 

where now D' is D with s replaced with s' . Now it 
easy to understand the significance of the fitting 
factor. The optimal SNR is reduced by this factor 
when the data is filtered with a template s that 
does not match the true signal s' exactly. 

This analysis can easily be extended now to the 
MES by considering the full set of pulses. Sum- 
ming over the pulse set Sj is equivalent to simply 
using the full pulse train s since each pulse is being 
windowed: 



p 



(61) 
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(62) 



p p 
s-s= (E^ME^ 

i=l 3=1 

where the Kronecker delta is used since the set 
of pulses are completely uncorrelated. This then 
gives: 



§ ■ 1 = E § j ■ ■ ^ 

and substituting into (1571) gives: 



(63) 



E*; 

i=i 



E> 



(64) 



If we consider just the j'th temporal contribution 
then the total fitting factor must be divided up: 



§'■ • Sj = Xj cos ( 



Ev«: 



\ 3=1 \ 3=1 



E ! 



(65) 



where the Xj are a set of P real constants satisfy- 
ing: 

p 

5>i = l. (66) 

3=1 

This shows quite simply that each temporal contri- 
bution has its own portion of the total mismatch of 
the pulse train and that they can all be different. 
This makes sense since, even if the shape of the 
transits in the pulse train are not changing, a pe- 
riod mismatch between the model and true signal 
will easily make the fractional mismatch depend 
on transit time. 

We are now in a position to re-calculate all the 
statistical properties of the relevant quantities as 
in equation (f3"Tl) : 

(Z) = .42? cos (9, 

(Z 2 ) = 1 + A 2 V 2 cos 2 e, 

(zj) = XjAD cos 0, 

(z 2 ) = q 3 - + X 2 A 2 V 2 cos 2 0, 

(zjZk) = qjSjk + XjXkA 2 V 2 cos 2 6, 

(Azj) = (Xj - qj) AD cos 0, 

((Azj) 2 ) = qj (l - qj) + (Xj - qj) 2 A 2 V 2 cos 2 0, 

(67) 



where again, 



V 



\ 



P N 

EE*» 

3=1 n =l 



Now the expectation value of the \ 2 is given by: 

= P - 1 + kA 2 V 2 cos 2 
-- P-1 + k(Z) 2 , 



(x 2 ) = 

where k is given by 



(69) 



p 

E 

3 = 1 



(X 3 - qjf/qj 



-1 



p 



_X 2 /qj. 

3 = 1 



(70) 



The variance of the \ 2 is then given by: 

- 2 = ((x 2 ) 2 ) - (x 2 ) 2 

= 2(P- 1) +4k(Z) 2 . 



(71) 



So K is a parameter depending on the degree of 
mismatch as well as the noise and is manifestly 
positive. In (6i a proof was given that this \ 2 
actually has a non-central x 2 distribution with P— 
1 degrees of freedom and non-centrality parameter 
given by k(Z} 2 . 

As in (jcj), we can obtain an upper limit on k 
from Schwarz's inequality: 



■ h) 2 



ft 

X 2 cos 2 



< (i'j ■ s'j)(ij ■ 



< 
< 



q 3 (s' 



cos^ J J 



(72) 



Now, summing both sides over j and using (|70|) . 
we have: 

< « < -4n; - !• (73) 
cos^ 

Since we have no prior knowledge of the mis- 
match between the true astrophysical signal and 
the template, we have no choice but to pick a suit- 
able value for the purpose of estimating the non- 
centrality parameter. To this end, a Monte Carlo 
study has been performed to get an estimate of 
the integral average of the signal/template mis- 
match. The true astrophysical signal is generated 
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exaclty as it is described above for DV, namely, us- 
ing the Mandel-Agol geometric transit model with 
the non-linear limb darkening of Claret. The im- 
pact parameter has been sampled uniformly in the 
range [0, 1], the model transit durations have been 
sampled uniformly in the range [1.5, 15], the tran- 
sit depths have been sampled logarithmically (to 
favor weaker signals) in the range [10 -4 3 , 10 -15 ], 
and the stellar parameters used for limb darken- 
ing are pulled randomly from the set of 180, 000+ 
targets in the Kepler Input Catalog (KIC) that 
are routinely searched. I have assumed the noise 
is WGN. 

For this study, we focus on a single transit 
pulse rather than the full pulse train. However, 
the potential mismatch in period has been used 
to expand the range on the allowable epoch mis- 
matches, so in effect it is being taken into account. 
In TPS, the allowed period mismatch is 

AT = 4(1 - p)d/N, (74) 

where p is the correlation coefficient (set to 0.9), 
d is the transit duration, and N is the number of 
transits. The epoch mismatch is then randomly 
sampled between: 

-4(1 - p)d/3 < At < 4(1 - p)d/3, (75) 

where d is the pulse duration in the set that TPS 
uses that most closely matches the randomly se- 
lected model value and N is set to 3 here since 
TPS requires a minimum of 3 transits. The epoch 
and transit duration mismatches have both been 
turned off, simultaneously, turned on, and also 
tested individually. 

Two sets of pulse shapes have been used. One 
set is comprised of the square wave pulses that 
TPS currently employs. The other set has been 
generated by using this Monte Carlo framework 
to build a set of normalized templates, averaged 
over randomly selected astrophysical models taken 
from the parameter space of interest mentioned 
above. This set of templates should on average 
have the best possible match to some random, true 
astrophysical signal. Directly searching over the 
astrophysical parameter space in TPS would add 
too much additional volume to the search param- 
eter space and would push us outside the compu- 
tational realm of feasability with our current re- 
sources. However, using these astrophysically mo- 
tivated templates, that have been averaged over 



the parameter space, will be the subject of an in- 
vestigation in the near future and could help us to 
achieve better detection efficiency and may also 
improve the vetoing power of the x%) ve t° (more 
details in section [5]). The results of this study are 
given in Table [1] 

From the table it is evident that the mismatch 
in transit duration is a small effect whereas pe- 
riod/epoch mismatch raises the average consis- 
tently by 2 — 3%. Even with the astrophysical 
model the mismatch for a single pulse is on av- 
erage around 4.3%, which is expected given that 
p = 0.9. This can be lowered by moving to a 
smaller spacing along the period dimension of pa- 
rameter space at the price of increasing the com- 
putational time. This may prove to be better than 
searching over the space of true astrophysical mod- 
els since that would require increasing the dimen- 
sionality of the parameter space. This will be the 
subject of future investigation. It is clear from the 
results however, that using the set of averaged as- 
trophysical templates cuts the mismatch down by 
about a factor of 2. 

6. Thresholding Conditions 

If the signal and template matched perfectly, 
then the results of section [3] show that our test 
statistic would be x 2 distributed. In this 
threshold could be set by simply evaluating the x 2 
cumulative distribution function. In reality how- 
ever, there is mismatch between the signal and 
templates, so our test statistic has a non-central 
X 2 distribution. The threshold, x*> that we choose 
then depends on the expected SNR and the mis- 
match. Still, if we had a perfect understanding 
of the mismatch then we could set a reasonable 
threshold by evaluating the non-central x 2 cumu- 
lative distribution function. Since we have no prior 
knowledge of the true astrophysical signal how- 
ever, one way to proceed, as described in the pre- 
vious section, is to understand what the integral 
average of the mismatch is and try and minimize 
it in any way possible (e.g. by using astrophys- 
ically motivated templates). Clearly, we need to 
allow some room for the true spread in mismatch 
when we set our threshold. The threshold for each 
candidate event will then be a function of the num- 
ber of degrees of freedom and the non-centrality 
parameter. 
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Table 1: Signal/Template Mismatch Results 





Square Wave Model 


Astrophysical Model 




Ad 




5e(%) 


e{%) 


Se(%) 


N 


N 


3.91 


0.013 


1.49 


0.0032 


Y 


N 


6.88 


0.021 


4.14 


0.018 


N 


Y 


4.32 


0.015 


1.66 


0.0051 


Y 


Y 


7.21 


0.019 


4.31 


0.015 



Note. — Results for the signal/template mismatch Monte Carlo study. Each result was generated with 50,000 trials. The <5e 
is the error in each mean mismatch value, e, given by tr/yfN {N being the number of trials or samples). 



In f6|), one approach suggested is based around 
the fact that near the distribution maximum, 
when the non-centrality parameter is large com- 
pared to the degrees of freedom, the non-central 
X 2 distribution can be approximated by a gaussian 
of width er, where a is the standard deviation of 
the non-central x 2 distribution. So in that case, 
it is suggested that a reasonable threshold would 
be: 

xl = {x 2 )+P°, (76) 
where p is a parameter that can be tuned empir- 
ically but would typically be something like 5. 
This sort of threshold has been investigated for 
use in TPS and is a subject of continued work. 
To date however, this method has not been suc- 
cessful, largely it is thought due to the variance of 
the set of true mismatches. So using this sort of 
thresholding scheme may hinge on our ability to 
lower the signal/template mismatch as described 
previously. 

An alternative thresholding method that seems 
to work well has additionally been investigated 
and is mentioned in (|6j) and described for TPS 
in (|12i ). For completeness, it is now reproduced 



here. The dependence of the mean of x%\ on the 
degrees of freedom can be eliminated by dividing 
it out and forming what is commonly referred to 
as the reduced x 2 : or : X 2 (assuming the signal and 
template matches perfectly and we have a x 2 dis- 
tribution rather than the non-central x 2 discussed 
above) : 



Xr 



x 



(77) 

P-l K ' 

After doing this the expectation value becomes 
unity 



Now we could simply choose a value of x 2 to thre- 
hold on empirically by testing it on prior search re- 
sults. While doing this testing however it was no- 
ticed that there is some small advantage to thresh- 
olding on a different quantity that essentially con- 
verts the quantity into units equivalent to a. The 
study was done using some of the earliest x 2 re- 
sults from the pipeline and has since been revisited 
now that all the subtleties pointed out in section [3] 
have been addressed. The quantity we currently 
threshold on in the pipeline is given by: 



)]* 



MES 

y^Cr 

MES 

Xr 



(79) 



(xi) 



1. 



(78) 



We currently use a value of 7.0 for 77* (note that 
the MES threshold is set to 7.1 based on a per- 
formance study in (4)). We currently use both 
Xi\) an d X( 2 ) f° r ve l° m g purposes in this manner. 
Since the Xm formalism is flawed by the fact that 
there is overlap in the wavelet components, we are 
working to replace that version with X 2 $) ■ For this 
reason, examples below will only be given for x%) • 

7. Examples 

Since the method of thresholding, the model 
pulse shape, and the versions of the x 2 statistic 
used for vetoing are a subject of ongoing work, 
the goal here is to present some basic examples 
that utilize the current state of the art. Begin by 
injecting three square wave pulses, each with a du- 
ration of 12 samples, equidistant from one another 
in a 1,024 sample time series. The square waves 
will be injected on top of zero mean, unit variance, 
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Gaussian noise and will each have a depth of 5a. 
Figure [2] shows the whitened data chunks Xj and 
the corresponding Zj and qj for each transit. For 
this example, the MES given by equation (TT9")) is 
19.1 whereas the sum over all the Zj is 27.76. The 
Zj components and qj components, given by sum- 
ming over the points in the corresponding plots, 
are: 

zt = 9.33, q x = 0.334 
z 2 = 9.03, q 2 = 0.332 
z 3 = 9.32, q 3 = 0.333 
Z = zx + z 2 + z 3 = 27.76 . (80) 

Note that these values have been rounded. The 
X 2 can then be computed: 

3 

X( 2 ) = 5>i - loZflti = °- 124 . (81) 
j=i 

with an associated probability computed by equa- 
tion ([391): 

^>0.124 = 1-P^<0.124 = 94% , (82) 
and rj given by: 

n = MES/xr = 76.6 . (83) 

So this is a very simple detection as it should 
be, but it serves to illustrate how the calculation 
works. 

Now, the same three square waves will be in- 
jected except that the last one will have its du- 
ration halved by a factor of two. The results ob- 
tained are now: 

zi = 9.22, qt = 0.334 
z 2 = 8.89, q 2 = 0.332 
z 3 = 4.43, q 3 = 0.333 
Z = Zl +z 2 + z 3 = 22.54 . (84) 

The MES is then 17.05 and the x 2 , probability, 
and r\ can then be computed: 

3 

X(2) = - ^Zf/lj = 42.89 , (85) 

P X =>42.89 = 1 - iV<42.89 = 4.8 X 10~ 8 % , (86) 



rj = MES/xr = 3.68 . (87) 

This illustrates how sensitive the veto is to mis- 
match in signal and template. 

As another example, the same three square 
waves will be injected except the last one will now 
have its amplitude reduced by a factor of two. The 
results obtained are now: 

zi = 8.80, qi = 0.335 
z 2 = 8.50, q 2 = 0.332 
z 3 = 4.30, q 3 = 0.334 
Z = zi + z 2 + z 3 = 21.60 . (88) 

The MES is then 16.28 and the x 2 , probability, 
and rj can then be computed: 

3 

X( 2 ) = $>i - QjZf/Qj = 38.20 , (89) 

^>38.20 = 1 - ^<38.20 - 5.07 X 10^% , (90) 

rj = MES/xr = 3.73 . (91) 

So the veto is sensitive to a mismatch in depth of 
one of the pulses as well. 

A logical extension of this would be to ask: how 
do these numbers change when there are more 
transits and still only a single transit is perturbed 
in some way? When the degrees of freedom get 
large enough, the effect will get washed out to the 
point that the veto will no longer work. This be- 
havior is acceptable in this case however, because 
if there are say, 10 transits, and only one of them 
is perturbed in some way, we would still have rea- 
sonable belief there is something interesting in the 
data. Clearly though, if many of the transits do 
not match well then the veto will work as it should. 

8. Results 

As described in (fl2j). events that pass the MES 
threshold of 7.1 in TPS arc then subjected to both 
a robust statistic test as well as the two x 2 tests 
mentioned in section in that order. Recently, 
TPS was run over a set of 192, 312 targets, (known 
eclipsing binaries were all removed), with quar- 
ters Ql ~ Q12, and with the x 2 thresholds turned 
down to gain an understanding of their operat- 
ing characteristics. The set of Kepler Objects of 
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Interest (KOI's) was used to understand the de- 
tection probability. The set of KCTs is comprised 
mostly of known planets as well as planet can- 
didates which have not gone through the vetting 
process. A set of 2,211 KOI stars, the best of 
the best, were set aside to examine the detection 
probability. To understand the false alarm proba- 
bility, the full set of 3009 unique KOI target stars 
were removed from the total set of targets, the re- 
mainder of which forms the false alarm population 
of targets. Of course, some of these false alarms 
may actually be true positives, but it is thought 
that the number is fairly low. It should also be 
noted that the false alarm and detection probabil- 
ities gleaned by a study of this sort are only to be 
used as an indicator since the true pipeline version 
of TPS allows for searching over many period and 
epoch combinations, for each target, that produce 
a MES above threshold. Here we have limited the 
number of searched combinations to only one per 
target. 

After applying the MES threshold of 7.1, there 
are 14 targets that get filtered out of the 2, 211 
KOI's, giving a 99.4% detection probability. There 
are then 86, 737 non-KOI targets that produce 
"false alarms" , for a false alarm rate of 45.8%. Ap- 
plying the robust statistic threshold of T) = 6.4 fil- 
ters out an additional 13 of the 2, 211 KOI's for an 
overall detection probability of 98.8%. The num- 
ber of non-KOI targets producing false alarms is 
reduced to 40, 880, for an overall false alarm rate 
of 21.6%. Now we will just use these two sets of 
remaining targets namely, the set of 2, 184 KOI's 
and the set of 40,880 non-KOI's, to explore var- 
ious thresholds on r)i m ) = MES/x(m)r- Figure [3] 
shows the operating characteristic curve for each r\ 
individually. You can clearly see that 7?( 2 ) achieves 
the best detection efficiency but would still, for 
some reasonable detection probability, let through 
an unacceptable number of false alarms if used 
alone. In figure |4l the corresponding false alarm 
and detection probabilities are plotted as a func- 
tion of threshold 77* • 

As mentioned in (fill ). TPS currently uses rjm 
and 77(2) for vetoing purposes, the threshold for 
both set to 7.0. Using 77(3) instead of 77m is the 
subject of ongoing work but is desirable due to 
the fact that the Xm formalism is flawed for rea- 
sons mentioned previously. On this data set, using 
the current TPS thresholds gives an overall (MES, 
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Fig. 2. — x, Zj, and qj for three injected 5<r square 
waves in Gaussian noise. 
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three vetoes of interest. 
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robust statistic, rym, and 77(2) ah being used) de- 
tection probability of 97% and an overall false 
alarm probability of 3.77% or 7,133 false alarm 
targets. If we were to replace rjm with V(3)i an d 
require the same detection probability, then the 
threshold for 77(3) would have to be set at 4.8 and 
would give an overall false alarm probability of 
4.89%, or 9, 260 targets. Further investigation re- 
veals that there is an issue with the fitted depth 
A being used in the calculation of rjr^y The fit- 
ted depth comes from the robust fit of a whitened 
model pulse train to the whitened data when the 
robust statistic is being calculated. This value can 
be perturbed by the robust statistic algorithmn 
however, since it has some machinery built in that 
allows it to deemphasize cadences based on the 
robust fit weights. This can throw off the compar- 
ison between the observed and expected values. 
To address this problem, and hopefully improve 
the detection efficiency of 77(3) enough to make it a 
more suitable replacement, the fitted depth is now 
being calculated by: 



A 



(92) 



We await future results. 

Another recent run of TPS was done over the 
same set of quarters and 192, 255 targets. In this 
run however, the full looping machinery was em- 
ployed so that TPS was examining up to 1,000 
events for each target that produced a sufficiently 
high MES. In this set of targets there were 2, 264 
targets with high quality KOFs, and 3,008 total 
KOFs in the sample. There were 13, 570 targets 
producing Threshld Crossing Events (TCEs) . Our 
detection probability in this run was 96% whereas 
the overall false alarm probability was 5.6%. In 
this run, using 77(1) and r)( 2 ) with thresholds both 
at 7.0 caused 2%, or about half, of the total loss 
in detection probability. Using the vetoes however 
dropped the number of false alarms from 55,233 
down to 10, 694, or from 29% overall false alarm 
rate down to just 5.6%. 
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Fig. 4. — False alarm and detection probabili- 
ties corresponding to the operating characteristic 
curves in figure [3j 
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9. Conclusion 

This paper extends the conceptual framework 
in (@) so that the formalism can be applied to 
the case of interest here, namely, in vetoing false 
alarms coming from the Transiting Planet Search 
component of the Kepler data processing pipeline. 
We have presented a set of potential vetoes and se- 
lected two, x%) anc ^ ^(3)' to b e the subject of fu- 
ture work and development based upon the results 
presented here. These vetoes cut down the false 
alarm rate considerably and are crucial in main- 
taining a large enough search parameter space and 
detection probability. Further mitigation of some 
of the known sources of systematic error (image ar- 
tifacts, etc), combined with improvements to this 
set of vetoes (using better templates and switch- 
ing to the method of thresholding described in sec- 
tion[6]) are other avenues being pursued to improve 
detection efficiency. 

The author wishes to thank Bruce Allen for the 
original work on this subject from which most of 
this work is based. Kepler was selected as the 10 th 
mission of NASA's Discovery Program. Funding 
for this work is provided by NASA's Science Mis- 
sion Directorate. 
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